Bayesian Conditional Density Filtering for Big Data
نویسندگان
چکیده
We propose a Conditional Density Filtering (C-DF) algorithm for efficient online Bayesian inference. C-DF adapts Gibbs sampling to the online setting, sampling from approximations to conditional posterior distributions obtained by tracking of surrogate conditional sufficient statistics as new data arrive. This tracking eliminates the need to store or process the entire data set simultaneously. We show that C-DF samples converge to the exact posterior distribution asymptotically, as sampling proceeds and more data arrive over time. We provide several motivating examples, and consider an application to compressed factor regression for streaming data, illustrating competitive performance with batch algorithms that use all of the data.
منابع مشابه
Bayesian Conditional Density Filtering
We propose a Conditional Density Filtering (C-DF) algorithm for efficient online Bayesian inference. C-DF adapts MCMC sampling to the online setting, sampling from approximations to conditional posterior distributions obtained by propagating surrogate conditional sufficient statistics (a function of data and parameter estimates) as new data arrive. These quantities eliminate the need to store o...
متن کاملUnited Statistical Algorithms, Small and Big Data, Future of Statistician
Role of big idea statisticians in future of Big Data Science. United Statistical Algorithms framework for comprehensive unification of traditional and novel statistical methods for modeling Small Data and Big Data, especially mixed data (discrete, continuous). Goal: Model (X, Y ) by nonparametrically estimating conditional mean E[Y |X = x] and conditional quantile Q(u;Y |X = x). Modeling exampl...
متن کاملBayesian Prediction Intervals under Bivariate Truncated Generalized Cauchy Distribution
Ateya and Madhagi (2011) introduced a multivariate form of truncated generalized Cauchy distribution (TGCD), which introduced by Ateya and Al-Hussaini (2007). The multivariate version of (TGCD) is denoted by (MVTGCD). Among the features of this form are that subvectors and conditional subvectors of random vectors, distributed according to this distribution, have the same form of distribution ...
متن کاملGeneralised Filtering
We describe a Bayesian filtering scheme for nonlinear state-space models in continuous time. This scheme is called Generalised Filtering and furnishes posterior conditional densities on hidden states and unknown parameters generating observed data. Crucially, the scheme operates online, assimilating data to optimize the conditional density on time-varying states and time-invariant parameters. I...
متن کاملVariational filtering
This note presents a simple Bayesian filtering scheme, using variational calculus, for inference on the hidden states of dynamic systems. Variational filtering is a stochastic scheme that propagates particles over a changing variational energy landscape, such that their sample density approximates the conditional density of hidden and states and inputs. The key innovation, on which variational ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1401.3632 شماره
صفحات -
تاریخ انتشار 2014